Subpolynomial trace reconstruction for random strings and arbitrary deletion probability

نویسندگان

  • Nina Holden
  • Robin Pemantle
  • Yuval Peres
چکیده

The deletion-insertion channel takes as input a bit string x ∈ {0, 1}, and outputs a string where bits have been deleted and inserted independently at random. The trace reconstruction problem is to recover x from many independent outputs (called “traces”) of the deletion-insertion channel applied to x. We show that if x is chosen uniformly at random, then exp(O(log n)) traces suffice to reconstruct x with high probability. The earlier upper bounds were exp(O(log n)) for the deletion channel with deletion probability less than 1/2, and exp(O(n)) for the general case. A key ingredient in our proof is a two-step alignment procedure where we estimate the location in each trace corresponding to a given bit of x. The alignment is done by viewing the strings as random walks, and comparing the increments in the walk associated with the input string and the trace, respectively.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Study of Random Biased d-ary Tries Model

Tries are the most popular data structure on strings. We can construct d-ary tries by using strings over an alphabet leading to d-ary tries. Throughout the paper we assume that strings stored in trie are generated by an appropriate memory less source. In this paper, with a special combinatorial approach we extend their analysis for average profiles to d-ary tries. We use this combinatorial appr...

متن کامل

Trace Reconstruction Revisited

The trace reconstruction problem is to reconstruct a string x of length n given m random subsequences where each subsequence is generated by deleting each character of x independently with probability p. Two natural questions are a) how large must m be as a function of n and p such that reconstruction is possible with high probability and b) how can this reconstruction be performed efficiently....

متن کامل

Trace reconstruction with varying deletion probabilities

In the trace reconstruction problem an unknown string x = (x0, . . . , xn−1) ∈ {0, 1, ...,m − 1} is observed through the deletion channel, which deletes each xk with a certain probability, yielding a contracted string X̃. Earlier works have proved that if each xk is deleted with the same probability q ∈ [0, 1), then exp(O(n)) independent copies of the contracted string X̃ suffice to reconstruct x...

متن کامل

Symbolic Channel Modelling for Noisy Channels Which Permit Arbitrary Noise Distributions

In this paper we present a new model for noisy channels which permit arbitrarily distributed substitution, deletion and insertion errors. Apart from its straightforward applications in string generation and recognition, the model also has potential applications in speech and unidimensional signal processing. The model is specified in terms of a noisy string generation technique. Let A be any fi...

متن کامل

Second Moment of Queue Size with Stationary Arrival Processes and Arbitrary Queue Discipline

In this paper we consider a queuing system in which the service times of customers are independent and identically distributed random variables, the arrival process is stationary and has the property of orderliness, and the queue discipline is arbitrary. For this queuing system we obtain the steady state second moment of the queue size in terms of the stationary waiting time distribution of a s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1801.04783  شماره 

صفحات  -

تاریخ انتشار 2018